Fixed bug: Uncontrolled recursive calls that caused an infinite loop when loading certain pipelines containing Transformer2DModel #11923

lengmo1996 · 2025-07-14T15:26:47Z

What does this PR do?

Fix infinite recursive call issue when loading pre-trained weights of Transformer2DModel with some norm_type parameters

Fixes # (issue)
PR #7647 attempts to map Transformer2DModel to two different variants based on norm_type: {PixArt, DiT}-Transformer2DModel. However, some models may use other norm_type parameters, such as ada_norm or layer_norm. At this time, if you try to load a pre-trained model of a pipeline composed of models using such norm_type parameters, the program will enter an endless recursive call. Specifically, when the from_pretrained function of the pipeline is called, the from_pretrained function of Transformer2DModel will be called. Since Transformer2DModel inherits the LegacyModelMixin class, the from_pretrained function of the LegacyModelMixin class will be called. The from_pretrained function of the LegacyModelMixin class determines whether the Transformer2DModel needs to be mapped to a variant through the _fetch_remapped_cls_from_config function. When Transformer2DModel does not need to be mapped, it will fall into an endless recursive call: Transformer2DModel.from_pretrained → LegacyModelMixin.from_pretrained → _fetch_remapped_cls_from_config → Transformer2DModel.from_pretrained

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
[ - ] Did you read the contributor guideline?
[ - ] Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

yiyixuxu · 2025-07-14T17:55:53Z

thanks @lengmo1996
would you be able to provide an example that will trigger the infinite loop (can create a dummy model)? it will easier for us to understand the problem

lengmo1996 · 2025-07-14T19:27:26Z

thanks @lengmo1996 would you be able to provide an example that will trigger the infinite loop (can create a dummy model)? it will easier for us to understand the problem

Thanks for your quick reply @yiyixuxu . Here is a simple example.
If I have a Transformer2DModel configured as follows:

{
  "_class_name": "Transformer2DModel",
  "_diffusers_version": "0.34.0",
  "activation_fn": "geglu-approximate",
  "attention_bias": true,
  "attention_head_dim": 88,
  "cross_attention_dim": 512,
  "dropout": 0.0,
  "in_channels": null,
  "norm_num_groups": 32,
  "num_attention_heads": 16,
  "num_embeds_ada_norm": 100,
  "num_layers": 36,
  "num_vector_embeds": 4097,
  "sample_size": 32,
  "norm_type": "ada_norm"
}

config.json
Please note that the parameter of norm_type is neither "ada_norm_zero" nor "ada_norm_single" at this time.
when I run the following python code I get an error: RecursionError: maximum recursion depth exceeded

from diffusers import Transformer2DModel

transformer = Transformer2DModel.from_pretrained("./simple_demo")

（I placed config.json in the simple_demo folder）

yiyixuxu

thanks!

HuggingFaceDocBuilderDev · 2025-07-14T22:42:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

lengmo1996 · 2025-07-14T23:43:06Z

I pushed a commit to make the changed files conform to the ruff format requirements. It should pass the job of check_code_quality.

DN6

Thanks for catching!

lengmo1996 · 2025-07-15T13:04:18Z

I have checked the details of failing check about "LoRA tests with PEFT main". It seems that the failure is not related to this modification. This modification involves mapping Transformer2DModel to two variants, but pipeline_components is configured as UNet in the test that reports the error. I also tried to pull the main branch and test it locally with the help of the diffusers/diffusers-pytorch-cpu:latest image, and it seems that the code before the modification will also cause the test to fail. Due to my limited knowledge in code testing, I am not sure how to make further modifications so that this PR can pass all tests perfectly. What else can I do?

…when loading certain pipelines containing Transformer2DModel (huggingface#11923) * fix a bug about loop call * fix a bug about loop call * ruff format --------- Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>

lengmo1996 added 2 commits July 14, 2025 22:22

fix a bug about loop call

0a2338e

fix a bug about loop call

bab876f

Merge branch 'main' into lengmo1996-patch-1

0d9a762

yiyixuxu approved these changes Jul 14, 2025

View reviewed changes

yiyixuxu requested a review from DN6 July 14, 2025 22:35

ruff format

1553550

DN6 approved these changes Jul 15, 2025

View reviewed changes

Merge branch 'main' into lengmo1996-patch-1

7595617

asomoza added the close-to-merge label Jul 15, 2025

Merge branch 'main' into lengmo1996-patch-1

5c739d5

yiyixuxu merged commit c5d6e0b into huggingface:main Jul 16, 2025
26 of 28 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixed bug: Uncontrolled recursive calls that caused an infinite loop when loading certain pipelines containing Transformer2DModel #11923

Fixed bug: Uncontrolled recursive calls that caused an infinite loop when loading certain pipelines containing Transformer2DModel #11923

Uh oh!

lengmo1996 commented Jul 14, 2025 •

edited

Loading

Uh oh!

yiyixuxu commented Jul 14, 2025

Uh oh!

lengmo1996 commented Jul 14, 2025

Uh oh!

yiyixuxu left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Jul 14, 2025

Uh oh!

lengmo1996 commented Jul 14, 2025

Uh oh!

DN6 left a comment

Uh oh!

lengmo1996 commented Jul 15, 2025

Uh oh!

Uh oh!

Uh oh!

Fixed bug: Uncontrolled recursive calls that caused an infinite loop when loading certain pipelines containing Transformer2DModel #11923

Fixed bug: Uncontrolled recursive calls that caused an infinite loop when loading certain pipelines containing Transformer2DModel #11923

Uh oh!

Conversation

lengmo1996 commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

yiyixuxu commented Jul 14, 2025

Uh oh!

lengmo1996 commented Jul 14, 2025

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jul 14, 2025

Uh oh!

lengmo1996 commented Jul 14, 2025

Uh oh!

DN6 left a comment

Choose a reason for hiding this comment

Uh oh!

lengmo1996 commented Jul 15, 2025

Uh oh!

Uh oh!

Uh oh!

lengmo1996 commented Jul 14, 2025 •

edited

Loading